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A class of binary signaling alphabets called "group alphabets" is de- 
scribed. The alphabets are generalizations of Hamming's error correcting 
codes and possess the following special features: (1) all letters are treated 
alike in transmission; (2) the encoding is simple to instrument; (3) maxi- 
mum likelihood detection is relatively simple to instrument; and (4) in 
certain practical cases there exist no better alphabets. A compilation is given 
of group alphabets of length equal to or less than 1 binary digits. 

INTRODUCTION 

This paper is concerned with a class of signaling alphabets, called 
"group alphabets," for use on the symmetric binary channel. The class 
in question is sufficiently broad to include the error correcting codes of 
Hamming,' the Reed-MuUer codes,^ and fi^ "systematic codes''.^ On 
the other hand, because they constitute a rather small subclass of the 
class of all binary alphabets, group alphabets possess many important 
special features of practical interest. 

In particular, (1) all letters of the alphabets are treated alike under 
transmission; (2) the encoding scheme is particularly simple to instru- 
ment; (3) the decoder — a maximum likelihood detector — is the best 
possible theoretically and is relatively easy to instrument; and (4) in 
certain cases of practical interest the alphabets are the best possible 
theoretically. 

It has very recently been proved by Peter Elias* that there exist group 
alphabets which signal at a rate arbitarily close to the capacity, C, of 
the symmetric binary channel with an arbitrarily small probability of 
error. Elias' demonstration is an existence proof in that it does not 
show explicitly how to construct a group alphabet signaling at a rate 
greater than C — e with a probability of error less than 5 for arbitrary 
positive 6 and e. Unfortunately, in this respe(!t and in many others, our 
understanding of group alphabets is still fragmentaiy. 

In Part I, group alphabets are defined along with some related con- 
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cepts necessary for their understaiiding. The main results obtained up 
to the present time are stated without proof. Examples of these concepts 
are given and a compilation of the best group alphabets of small size 
is presented and explained. This section is intended for the casual reader. 

In Part II, proofs of the statements of Part I are given along with 
such theory as is needed for these proofs. 

The reader is assumed to be familiar with the paper of Hamming, 
the basic papers of Shannon^ and the most elementary notions of the 
theory of finite groups. 

Part I — Group Alphabets and Their Properties 

1.1 INTRODUCTION 

We shall be concerned in all that follows with communication over the 
symmetric binary (channel shown on Fig. 1. The channel can accept 
either of the two symbols or 1. A transmitted is received as a with 
probability q and is received as a 1 with probability p = I - q-.a trans- 
mitted 1 is received as a 1 with probability q and is received as a with 
probability p. We assume g p g hi. The "noise" on the channel 
operates independently on each symbol presented for transmission. The 
capacity of this channel is 

C = 1 + p logap + q log2(7 bits/symbol (1) 

By a K-letter, n-place binary signaling alphabet we shall mean a collec- 
tion of K distinct sequences of n binary digits. An individual sequence 
of the collection will be referred to as a letter of the alphabet. The integer 
K is called the size of the alphabet. A letter is transmitted over the 
channel by presenting in order to the channel input the sequence of n 
zeros and ones that comprise the letter. A detection scheme or detector for 




INPUT X: °^^^^^ 

T 

Fig. 1 — The symmetric binary channel. 
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a given /il -letter, n-place alphabet is a procedure for producing a sequence 
of letters of the alphabet from the channel output. 

Throughout this paper we shall assume that signaling is accomplished 
with a given K-lcttar, n-place alphabet by choosing the letters of the 
alphabet for transmission independently with equal probability 1/K. 

Shannon^ has shown that for sufficiently large n, there exist iC-letter, 
n-place alphabets and detection schemes that signal over the symmetric 
binary channel at a rate R > C — e for arbitrary £ > and such that 
the probability of error in the letters of the detector output is less than 
any 5 > 0. Here C is given by (1) and is shown as a function of p in 
Fig. 2. No algorithm is known (other than exhaustvie procedures) for 
the consti'uction of K-letier, n-place alphabets satisfying the above 
inequalities for arbitrary positive 5 and s except in the trivial cases C = 
and C = 1. 

1.2 THE GROUP 5„ 

There are a totality of 2" different n-place binary sequences. It is fre- 
quently convenient to consider these sequences as the vertices of a cube 
of unit edge in a Euclidean space of n-dimensions. For example the 5- 
place sequence 0, 1 , 0, 0, 1 is associated with the point iu 5-space whose 




0.4 0.5 



Fig. 2 — The capacity of the sjTnmetric binary channel. 
C = 1 + p log2 p + (1 - P) logs (1 - V) 



206 



THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 



coordinates are (0, 1, 0, 0, 1). For convenience of notation we shall gen- 
erally omit commas in writing a sequence. The above 5-place sequence 
will be written, for example, 01001. 

We define the product of two n-place binanj sequences, aitti ■ ■ ■ a^ and 
foi^2 ■ ■ • hn as the n-placc binary sequence 



«i + &i , 



, On + &n 



Here the a's and (j's are zero or one and the + sign means addition 
modulo 2. (That is + 0=1 + 1=0, 0+1-1+0=1) 
For example, (01101) (00111) = 01010. With this rule of multiplication 
the 2" n-place binary sequences form an Abelian group of order 2". 
The elements of the group, denoted by 'l\, T2, ■•• , Tin, say, are the 
n-place binary sequences; the identity element I is the sequence 000 • ■ ■ 
and 

n\ = Til = Ti ; TiT,- = TjTi ; TiiTjT,) - {TiTj)n ; 

the product of any number of elements is again an element; every ele- 
ment is its own reciprocal, Ti = Ti~\ Tl - /. We denote this group 
byS„. 

All subgroups of -B„ are of order 2*^ where k is an integer from the set 

0, 1, 2, • ■ ■ ,n. There are exactly 

( 2" - 2°) (2" - 2^) (2" - 2') ■ ■ ■ (2" - 2^"') 

W^^^^) (2) 



A^(^ h) = 



(2^ _ 2'>)(2'^ - 20(2*= - 2^) 
= N(n, n — k) 

distinct subgroups of Bn of order 2*. Some values of N{n, k) are given in 
Table I. 



Table I — Some Values of ^^(n, fc), the Number of Subgroups 

OF Bn OF Order 2^. N(n, k) = N{n, n — k) 



»\k 





1 


2 


3 


-! 


5 


2 




3 


1 








3 




7 


7 


1 






4 




15 


35 


15 


1 




6 




31 


155 


155 


31 


1 


6 




63 


651 


1395 


651 


63 


7 




127 


2667 


11811 


11811 


2667 


8 




255 


10795 


97155 


200787 


97155 


9 




511 


43435 


788035 


3309747 


3309747 


10 




1023 


174251 


6347715 


53743987 


109221651 



(i) 


(ii) 


(iii) 


(iv) 


(v) 


(vi) 


(vii) 


000 


000 


000 


000 


000 


000 


000 


100 


100 


100 


010 


010 


001 


no 


010 


001 


on 


001 


101 


no 


on 


110 


101 


111 


on 


111 


111 


101 
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1.3 GROUP ALPHABETS 

An n-p\ace group alphabet is a /C-letter, n-place biiiaiy signaling alpha- 
bet whose letters form a subgroup of B„ . Of necessity the size of an 
n-place group alphabet is K = 2'' where fc is an integer satisfying ^ 
k ^ 71. By an (?i, k)-alphabd we shall mean an n-place group alphabet of 
size 2*". P^xample: the A''(3, 2) = 7 distinct (3, 2)-alphabets are given by 
the seven columns 



(3) 



1.4 STANDARD ARRAYS 

Let the letters of a specific (n, fc)-alphabet be ^i ^ / = 00 ■ ■ ■ 0, 
Ai, Ai, ' ■ ■ ,A^, where m = 2*". The group -B„ can be developed accord- 
ing to this subgroup and its cosets: 

7, A2, A3 , ••■ ,A^ 

S2 , S2A2 , S^As t ■ ■ ' , S2Af, 

S3 , S3A2 , SsAs , ■ ■ " , OaAp 

S, , S,A2 , SvAs , • • • , S,Ap 

H = 2 , V = 2. 

In this array every element of 5„ appears once and only once. The col- 
lection of elements in any row of this array is called a cosei of the (n, h)- 
alphabet. Here S2 is any element of B^ not in the first row of the array, 
Sz is any element of S„ not in the first two rows of the array, etc. The 
elements Si, S3 , • • ■ , S, appearing under I in such an array will be 
called the coset leaders. 

If a coset leader is replaced by any element in the coset, the same coset 
will result. That is to say the two collections of elements 

Si , SiA2 , 0,03 J " " ■ , SiA^ 

and 

SiA, , {SMA2 , (SiA,)A, ,■■■ {SiAk)A, 

are the same. 
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We define the weight Wi = w(Ti) of an element, T, , of B^ to be the 
number of ones in the n-place binary sequence T, . 

Henceforth, unless otherwise stated, we agree in dealing with an ar- 
ray such as (4) to adopt the following convention : 

the leader of each coset shall be taken to he an . ■. 

ele7nent of minimal weight in that coset. 

Such a table will be called a standard array. 

Example: Bt can be developed according to the (4, 2)-alphabet 0000, 
1100, 0011, nil as follows 



0000 1100 0011 nil 

1010 0110 1001 0101 

1110 0010 1101 0001 

1000 0100 1011 0111 

According to (5), however, we should write, for example 



(6) 



0000 


1100 


0011 


nil 


1010 


Olio 


1001 


0101 


0010 


1110 


0001 


1101 


1000 


0100 


1011 


0111 



(7) 



The coset leader of the second coset of (6) can be taken as any element 
of that row since all are of weight 2. The leader of the third coset, how- 
ever, should be either 0010 or 0001 since these are of weight one. The 
leader of the fourth coset should be either 1000 or 0100. 

1.6 THE DETECTION SCHEME 

Consider now communicating with an (n, /c)-alphabet over the sym- 
metric binary channel. When any letter, say Aj , of the alphabet is 
transmitted, the received sequence can be of any element of B„ . We 
agree to use the following detector: 

if the received element of B„ lies in column i of the array (4) , the 

detector prints the letter Ai ,i = 1, 2, ■ ■ ■ , m- The array (4) is to (8) 

be constructed according to the convention (5). 

The following propositions and theorems can be proved concerning 
signaling with an (n, /j)-alphabet and the detection scheme given by (8). 

1.6 BEST DETECTOR AND S'i'MMETRIC SIGNALING 

Define the probability fj = f(Ti) of an element Ti of B„ to be /,- = 
pU-i^n-u-i ^jjgpg p ^jjj q ^^g 03 jjj ^j Qj^^ y,. jg ^|jg weight of Ti . Let 
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Qi , i = I, 2, ■ • ■ , fihe the sum of the probabiUties of the elements in 
the ith column of the standard array (4). 

Proposition 1. The probability that any transmitted letter of the 
(n, fc)-alphabet be produced correctly by the detector is Qi . 

Proposition 2. The equivocation^ per symbol is 

Hy{x) = -ii:Qilog2Q.- 

n ,=1 

Theorem 1. The detector (8) is a maximum likelihood detector. That 
is, for the given alphabet no other detection scheme has a greater average 
probability that a transmitted letter be produced correctly by the de- 
tector. 

Let us return to the geometrical picture of n-place binary sequences 
as vertices of a unit cube in n-space. The choice of a if -letter, n-place 
alphabet corresponds to designating K particular vertices as letters. 
Since the binary sequence corresponding to any vertex can be produced 
by the channel output, any detector must consist of a set of rules that 
associates various vertices of the cube with the vertices designated as 
letters of the alphabet. We assume that eveiy vertex is associated with 
some letter. The vertices of the cube are divided then into disjoint sets, 
Wi , W2 , • • ■ , Wk where W,- is the set of vertices associated with ith 
letter of the signaling alphabet. A maximum hkelihood detector is char- 
acterized by the fact that every vertex in Wi is as close to or closer to 
the ith letter than to any other letter, z = 1, 2, ■ ■ ■ , Tf. For group alpha- 
bets and the detector (8), this means that no element in the ith column 
of array (4) is closer to any other A than it is to J.; , « = 1, 2, ■ ■ ■ , ju. 

Theorem 2. Associated with each (n, /i-)-alphabet considered as a point 
configuration in Euclidean n-space, there is a group of n X n orthogonal 
matrices which is transitive on the letters of the alphabet and which 
leaves the unit cube invariant. The maximum likelihood sets W\ , 
VF2 , • ■ ■ Tf p are all geometrically similar. 

Stated in loose terms, this theorem asserts that in an (/i, fc)-alphabet 
every letter is treated the same. Every two letters have the same number 
of nearest neighbors associated with them, the same number of next 
nearest neighbors, etc. The disposition of points in any two W regions 
is the same. 

1.7 GROUP ALPHABETS AND PARITY CHECKS 

Theorem 3. Every group alphabet is a systematic code: every syste- 
matic code is a group alphabet. 
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We prefer to use the word "alphabet" in place of "code" since the 
latter has many meanings. In a systematic alphabet, the places in any 
letter can be divided into two classes : the information places — k in 
number for an (n, /i;)-alphabet — and the check positions. All letters 
have the same information places and the same check places. If there 
are k information places, these may be occupied by any of the 2^ /c-placc 
binary sequences. The entries in the n — h check positions are fixed 
linear (mod 2) combinations of the entries in the information positions. 
The rules by which the entries in the check places are determined are 
called 'parity checks. Examples: for the (4, 2)-alphabet of (6), namely 
0000, 1100, 0011, 1111, positions 2 and 3 can be regarded as the informa- 
tion positions. If a letter of the alphabet is the sequence aiazaztti , then 
ai = a2 , 04 = 03 are the parity checks determining the check places 1 
and 4. For the (5, 3)-alphabet 00000? 10001, OlOU, 00111, 11010, 10110, 
01100, 11101 places 1, 2, and 3 (numbered from the left) can be taken 
as the information places. If a general letter of the alphabet is aiflaflsQias , 
then a4 = 02 -t- fla , Ob = «i + 02 + tta - 

Two group alphabets are called equivalent if one can be obtained from 
the other by a permutation of places. Example: the 7 distinct (3, 2)- 
alphabets given in (3) separate into three equivalence classes. Alpha- 
bets (i), (ii), and (iv) are equivalent; alphabets (iii), (v), (vi), are equiva- 
lent; (vii) is in a class by itself. 

Proposition S. Equivalent (n, fc)-alphabets have the same probability 
Qi of correct transmission for each letter. 

Proposition 4- Every (n, /c)-alphabet is equivalent to an (n, k)- 
alphabet whose first k places are information places and whose last n — k 
places are determined by parity checks over the first k places. 

Henceforth we shall be concerned only with {n, fc) -alphabets whose 
first k places are information places. The parity check rules can then 
be written 

k 
ai = £ yij^j , i = k + I, ■ ■ • , n (9) 

where the sums are of course mod 2. Here, as before, a typical letter of 
the alphabet is the sequence oiOa ■ ■ ■ o„ . The jij are k(n — k) quantities, 
zero or one, that serve to define the particular (n, /i;)-alphabet in question . 

1.8 MAXIMUM LIKELlkoon UETECTION BY PARITY CHECKS 

For any element, T, of Bn we can form the sum given on the right of 
(9). This sum may or may not agree with the symbol in the zth place of 
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T. If it does, we say T satisfies the tth-place parity check; otherwise T 
fails the ith-place parity check. When a set of parity check rules (9) is 
given, we can associate an (n — fc)-place binary sequence, R{T), with 
each element T of 5„. We examine each cheek place of T in order starting 
with the {k + l)-st place of T. We write a zero if a place of T satisfies 
the parity check; we write a one if a place fails the parity check. The re- 
sultant sequence of zeros and ones, written from left to right is R{T). 
We call R{T) the -parity check sequence of T. Example: with the parity 
rules a4 ^ 02 -j- fl3 , as = ai -j- 02 + «3 used to define the (5, 3)-alphabet 
in the examples of Theorem 3, we find /^(llOOO) = 10 since the sum of 
the entries in the second and third places of UOOl is not the entry of 
the fourth place and since the sum of ai = 1, 02 = 1, and 03 = is 
= as . 

Theorem 4. Let /, ^2 , ■ ■ ■ A^ be an (n, ^)-alphabet. Let R{T) be the 
parity check sequence of an element T of Bn formed in accordance with 
the parity check rules of the {n, /o)-alphabet. Then R{Ti) = R{T^) if 
and only if 7'i and T^ lie in the same row of array (4). The coset leaders 
can be ordered so that R{Si) is the binary symbol for the integer i — 1. 

As an example of Theorem 4 consider the (4, 2)-alphabet shown with 
its cosets below 

0000 1011 Old 1110 

0100 1111 0001 1010 

0010 1001 0111 1100 

1000 0011 1101 Olio 

The parity check rules for this alphabet are as = ai , 04 = ai + ^2 ■ 
Every element of the second row of this array satisfies the parity check 
in the third place and fails the parity check in the 4th place. The parity 
check sequence for the second row is 01. The parity check for the third 
row is 10, and for the fourth row 11. Since every letter of the alphabet 
satisfies the parity checks, the parity check sequence for the first row is 
00. We therefore make the following association between parity check 
sequences and coset leaders 



00 - 


^ 0000 = Si 


01 - 


^ 0100 = s^ 


10 - 


-> 0010 = Si 


11 - 


-> 1000 = 54 



1.9 INSTRUMENTING A GROUP ALPHABET 

Proposition 4 attests to the ease of the encoding operation involved 
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with the use of an (n, /k) -alphabet. If the original message is presented as 
a long sequence of zeros and ones, the sequence is broken into blocks of 
length k places. Each block is used as the first k places of a letter of 
the signaling alphabet. The last n-k places of the letter are determined 
by fixed parity checks over the first k places. 

Theorem 4 demonstrates the relative ease of instrumenting the maxi- 
mum hkelihood detector (8) for use with an (n, A;)-alphabet. When an 
element T of 5„ is received at the channel output, it is subjected to the 
n-k parity checks of the alphabet being used. This results in a parity 
check sequence R(T). R{T) serves to identify a unique coset leader, say 
Si . The product SiT is then formed and produced as the detector out- 
put. The probability that this be the correct letter of the alphabet is Qi . 

1.10 BEST GHOUP ALPHABETS 

Two important questions regarding {n, fc)-alphabets naturally arise. 
What is the maximum value of Qi possible for a given n and k and which 
of the A^(?i, k) different subgroups give rise to this maximum Qi? The 
answers to these questions for general n and k are not known. For many 
special values of n and k the answers are known. They are presented in 
Tables II, III and IV, which are explained below. 

The probability Qi that a transmitted letter be produced correctly by 
the detector is the sum, f^i = ^i i(Si) of the probabilities of the coset 
leaders. This sum can be rewritten as Qi = 2li=o on p^g" ' where «:,■ is 
the number of coset leaders of weight i. One has, of course, ^at = v = 
nn-k - ' ' /n\ nl 



for an (n, fc)-alphabet. Also a; ^ ( . 1 = ttt— ^ — rr ! since this is the 

\i / t\{n — t) 

number of elements of B„ of weight i. 

The ai have a special physical significance. Due to the noise on the 
channel, a transmitted letter, Ai , of an (n, /v)-alphabet -will in general be 
received at the channel output as some element T of B„ different from 
Ai . If T differs from .4,- in s places, i.e., if w(A,T) = s, we say that an 
s-tuple error has occurred. For a given {n, Z;)-alphabet, ai is the number 
of i-tuple errors which can be corrected by the alphabet in question, 
i = 0,1,2, ■•• ,n. 

Table II gives the cti corresponding to the largest possible value of Qi 
for a given k and n, for fc = 2, 3, ■ ■ ■ w. — 1, n = 4 • • ■ , 10 along with a 
few other scattered values of n and k. For reference the binomial coeffi- 
cients! . J are also listed. For example, we find from Table II that the 
best group alphabet with 2 =^ IG letters that uses n = 10 places has a 
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probability of conect transmission Qi = g^'^ + lO^^p + 39gV + 1-lgV^- 
The alphabet corrects all 10 possible single errors. It corrects 39 of the 

possible ( ^ j = 45 double errors (second column of Table II) and in 

addition corrects 14 of the 120 possible triple errors. By adding an addi- 
tional place to the alphabet one obtains with the best (11, 4)-alphabet 
an alphabet with 10 letters that corrects all 11 possible single errors and 
all 55 possible double errors as well as 61 triple errors. Such an alphabet 
might be useful in a computer representing decimal numbers in binaiy 
form. 

For each set of a's listed in Table II, there is in Table III a set of 
parity check rules which determines an (/i, A)-alphabet having the given 
a's. The notation used in Table III is best explained by an example. A 
(10, 4)-alphabet which realizes the a's discussed in the preceding para- 
graph can be obtained as follows. Places 1, 2, 3, 4 carry the information. 
Place 5 is determined to make the mod 2 sum of the entries in places 
3, 4, and 5 e(|ual to zeio. Place 6 is determined by a similar parity check 
oil places 1, 2, 3, and 6; place 7 by a check on places 1, 2, 4, and 7, etc. 

It is a surprising fact that for all cases in\-estigated thus far an (ii, k)- 
alphabet best for a given value of p is uniformly best for all values of 
p, ^ p ^ ) 2- It is of course conjectured that this is true for all n and k. 

It is a further (perhaps) surprising fact that the best (n, ^')-alphabots 
are not neces.sarily those with greatest nearest neighbor distance be- 
tween letters when the alphabets are regarded as point configurations on 
the Ji-cnbe. For example, in the best (7, 3)-alphabet as listed in Table 
111, each letter has two nearest neighbors distant 3 edges away. On the 
other hand, \n the (7, 3)-alphabet givcLi by the parity check rules 413, 
512, (123, 7123 each letter has its nearest neighbors 4 edges away. This 
latter alphabet does not have as large a value of Qi , however, as does 
the (7, 3)-alphabet listed on Table III. 

The cases k = 0, l,n — 1, n have not been listed in Tables II and III. 
The cases /; = and /; = n are completely trivial. For /,' = 1, all n > 1 
the best alphabet is obtained using the parity rule n-z = a^ = ■ ■ ■ = 
«„ = Oi . If ?; = 2j, 

<h - % (:) v'<r + \ (;;) vw. n « ^ 2i + , <?. = t (^) ^v 

For k = n — \, n > 1, the maximum Q\ is Qi = q"~^ and a parity rule 
for an alphabet realizing this Q\ is a„ = ai . 

If the a's of an (/), /.'j-alphabet are of the form «, ^ I . J , ;' = 0, 1, 



n— I 
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Table II — Probability of No Error with Best 
Alph.\bets, Qi ^ 2I f^iVW 







(?) 


i = 2 


k = i 


k = i 


i = 5 


k = 6 


k = 7 


k = B 


i = g 


* - 10 




i 


ii' 


ii 


Oj 


Of 


Oi 


<i, 


Of 


"i' 


"i 


« = 4 




1 




1 
4 

1 


1 

3 

1 


I 
















71 = 5 


1 

2 




5 
10 

1 


5 
2 

1 


3 
I 


1 














71 = 6 


1 

2 



15 


6 

9 


6 
1 


3 



















1 


1 


1 


1 
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n = 1 


1 
2 
3 


7 
21 
25 


7 

18 
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7 
8 


7 


3 
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1 
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1 


1 


1 










ji. = 8 


1 
2 
3 


8 
28 
56 


8 
28 

27 


8 

20 

3 


8 

7 


7 


3 
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1 


1 


1 


1 


1 


1 










1 


<) 


9 


9 


9 


9 


7 


3 








71 = 9 


2 
3 
4 


36 

84 

126 


36 
64 

18 


33 
21 


22 


G 

















1 


1 


1 


1 


1 


1 


1 


1 








1 


10 


10 


10 


10 


10 


10 


7 


3 






71 = 10 


2 

3 
4 


45 
120 
210 


45 

110 

90 


45 

G4 

8 


39 
14 


21 


5 















1 


1 


1 


1 




1 


1 


1 


1 






1 


11 


11 


11 


11 




11 


11 


7 


3 




11 = 11 


2 
3 
4 
5 


55 
165 
330 
462 


55 
165 
226 

54 


55 

126 

63 


55 
61 




20 


4 
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1 
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1 


1 


1 




1 


12 


12 


12 








12 


12 


7 


3 


71 = 12 


. 2 

3 
4 
5 


66 

220 
495 
702 


66 
220 
425 
300 


66 
200 
233 








19 


3 




■ -if' 
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2, ■ ■ ■ , i, «j+i = '■ some integer, ay+n = oij+g = - ■ ■ = a„ = 0, then 
there does not exist a 2Metter, n-place alphabet of any sort better than 
the given (n, /f) -alphabet. It will be observed that many of the a's of 
Table II are of this form. It can be shown that 

Proposition 5 if n + ^" ~ 'M' + ('^ 3 'M ^ 2"-' - 1 there exists 

no 2''-Ietter, n-place alphabet better than the best (n, /c)-alphabet. 
When the inequality of proposition 5 holds the a's are either ao = 1, 

«i = 2""' - ] , all other a = 0; or a^ = 1, ai = (^j , a^ = 2"'' - 1 - 
all other a = 0; or the trivial a^ = I all other a = which holds 



1 

when k = n. The region of the n — k plane for which it is known that 
(n, /i:)-alphabets cannot be excelled by any other is shown in Table IV. 

1.11 A DETAILED EXAMPLE 

As an example of the use of (n, A)-alphabets consider the not un- 
realistic case of a channel with p = 0.001, i.e., on the average one binary 
digit per thousand is received incorrectly. Suppose we wish to transmit 
messages using 32 different letters. If we encode the letters into the 32 
5-place binary sequences and transmit these sequences without further 
encoding, the probability that a received letter be in error is 1 — 
(1 - p)^ = 0.00449. If the best (10, 5)-alphabet as shown in Tables 11 
and III is used, the probability that a letter be wrong is 1 — Qi = 
1 _ 5^" _ lOq^p - 21 gV = 'lip' - 12p' + ■■■ - 0.000024. Thus 
by reducing the signaling rate by Vz, a more than one hundredfold re- 
duction m probabihty of error is accomplished. 

A (10, 5)-alphabet to achieve these results is given in Table III. Let 
a typical letter of the alphabet be the 10-place sequence of binary digits 
ttifla ■ ■ - Qgflio . The symbols aia2«3a4a5 carry the information and can be 
any of 32 different arrangements of zeros and ones. The remaining places 
are determined by 

06 = «! + 03 + (^4 + «5 

07 = fli -j- fla -j- ^^4 -j- tta 
Os = (h -j- 0-2 -j- as + Ob 
flg ^ ai -]- 02 + a-i -j- tti 

nil, = Ox -j- <^-2 + «3 -j- a-A -j- Ob 

To design the detector for this alphabet, it is first necessary to deter- 
mine the cosot leaders for a standard array (4) formed for this alphabet. 
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Table IV — Region of the n-k Plane for Which it is Known 
THAT (n, fc) -Alphabets Cannot Be Excelled 



k 
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28 

27 
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g 

8 
7 
6 

5 
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3 
2 
1 . 



12345678910 12 14 16 18 20 22 24 26 28 30 n 



This can be done by a variety of special methods which considerably 
reduce the obvious labor of making such an array. A set of best S's along 
with their parity check symbols is given in Table V. 

A maximum likelihood detector for the (10, 5)-alphabet in question 
forms from each received sequence ?>ib2 ■ ■ ■ 6io the parity check symbol 
C1CSC3C4C6 where 

Ci = hs -i- ^Ji -i- ^3 -]- ^4 -i- h 

C2 = bj -i- bi -i- ^2 + &4 + ^5 

C3 = bs -j- &i + ?'2 + bs -i- &5 

Ci = h + bi + ^^2 + b, -i- hi 

cb = b,o ^bi^h-2 -I- b, -i- Ih + b. 

According to Table V, if CiCzCsCiC^ contains loss than three ones, the de- 
tector should brint hih-Mubf, . The detector should print (bi -j- Dh^hbth 
if the parity check sequence C1C2C3C4CB is either 11111 or 11110; the de- 
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Table V — Coset Leaders and Parity Check Sequences 
FOR (10,5)-Alphabet 
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tector should print i>i(b2 -\- l)&3'>4bs if the parity check sequence is 01111, 
00111, 01011, 01101, or OHIO; the detector should print hMbs + l)bih 
if the parity check sequence is 10111, 10011, 10101, or 10110; the de- 
tector should print W>563(''4 -f l)65if the parity check sequence is 11011, 
11001, 11010; and finally the detector should print bihihhiibi -j- 1) if the 
parity check sequence is 11101 or 11100. 

Simpler rules of operation for the detector may possibly be obtained 
by choice of a different set of S's in Table V. These quantities in general 
are not unique. Also there may exist non-equivalent alphabets with 
simpler detector rules that achieve the same probability of error as the 
alphabet in question. 

Part II — Additional Theory and Proofs of Theorems of Part I 

2.1 THE ABSTRACT GROUP C„ 

It will be helpful here to say a few more \\ords about Bn , the group 
of n-place binary sequences under the operation of addition mod 2. This 
group in simply isomorphic with the abstract group C„ generated by n 
commuting elements of order two, say ai, 03 , ■ ■ , «„ . Here aiaj = 
ajQi and «/' = I, i, j = \, 2, ■ ■ ■ , n, where / is the identity for the 
group. The eight distinct elements of C3 are, for example, /, Oi , a-2 , 
az , aiQa , aiQs , a-ia^ , aia^a^ . The group C„ is easily seen to be isomorphic 
with the '/i-fold direct product of the group Ci with itself. 
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It is a considerable saving in notation in dealing with C„ to omit the 
symbol "a" and write only the subscripts. In this notation for example, 
the elements of Ca are /, 1, 2, 3, 4, 12, 13, 14, 23, 24, 34, 123, 124, 134, 
234, 1234. The product of two or more elements of Cn can readily be 
written down. Its symbol consists of those numerals that occur an odd 
number of times in the collection of numerals that comprise the sym- 
bols of the factors. Thus, (12){234)(123) = 24. 

The isomorphism between C„ and B„ can be estabhshed in many ways. 
The most convenient way, perhaps, is to associate with the element 
iiiiiz ■ ■■ ikof Cn the element of B,, that has ones in places ii , t2 , ■ ■ ■ , 4 
and zeros in the remaining n — k places. For example, one can associate 
124 of d with 1101 of ^4 ; 14 with 1001, etc. In fact, the numeral no- 
tation afforded by this isomorphism is a much neater notation for S„ 
than is afforded by the awkward strings of zeros and ones. There are, 
of course, other ways in which elements of C„ can be paired with elements 
of Bn so that group multiplication is preserved. The collection of all such 
"pairings" makes up the group of automorphisms of C„ . This group of 
automorphisms of C„ is isomorphic with the group of non-singular hnear 
homogenous transformations in a field of characteristic 2. 

An element T of C„ is said to be dependent upon the set of elements 
Ti, T-i, ■ ■ ■ , Tj of C„ if T can be expressed as a product of some ele- 
ments of the set Ti , Ta , ■ • • , Ty ; otherwise, T is said to be independent 
of the set. A set of elements is said to be independent if no member can 
be expressed solely in terms of the other members of the set. For example, 
in Ca , 1, 2, 3, 4 form a set of independent elements as do likewise 2357, 
12357, 14. However, 135 depends upon 145, 3457, 57 since 135 = 
(145)(3457)(57). Clearly any set of n independent elements of C„ can 
be taken as generators for the group. For example, all possible products 
formed of 12, 123, and 23 yield the elements of Cs . 

Any k independent elements of C„ serve as generators for a subgroup 
of order 2\ The subgroup so generated is clearly isomorphic with Ct . 
All subgroups of C„ of order 2* can be obtained in this way. 

The number of ways in which k independent elements can be chosen 
from the 2" elements of C„ is 

F(n, k) = (2" - 2'')(2'' - 2')(2" - 2^) ■ ■ • (2" - 2'=-) 

For, the first element can be chosen in 2" - 1 ways (the identity cannot 
be included in a non-trivial set of independent elements) and the second 
element can be chosen in 2" - 2 ways. These two elements determine a 
subgroup of order 2\ The third element can be chosen as any element of 
the remaining 2" - 2^ elements. The 3 elements chosen determine a 
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subgroup of order 2^ A fourth independent element can be chosen as 
any of the remaining 2" — 2' elements, etc. 

Each set of k independent elements serves to generate a subgroup of 
order 2*. The quantit)' F{n, k) is not, however, the number of distinct 
subgroups of C„ of this order, for, a given subgroup can be obtained 
from many different sets of generators. Indeed, the number of different 
sets of generators that can generate a given subgroup of order 2*" of C„ 
is just F{k, k) since any such subgroup is isomorphic with Ck . Therefore 
the number of subgroups of C,, of order 2*" is N(n, k) = F{n, k)/F(k, k) 
which is (2). A simple calculation gives N(n, k) = N{n, n — k). 

2.2 PROOF OF PROPOSITIONS 1 AND 2 

After an element A of B„ has been presented for transmission over 
a noisy binary channel, an element T of Bn is produced at the channel 
output. The element U = AT of B^ serves as a record of the noise 
during the transmission. U is an «-place binary sequence with a one at 
each place altered in A by the noise. The channel output, T, is obtained 
from the input A by multiplication by U:T = UA. For channels of the 
sort under consideration here, the probability that U be any particular 
element of B^ of weight w is p*"?"'*". 

Consider now signaling with a particular (n, /(;)-alphabet and consider 
the standard array (4) of the alphabet. If the detection scheme (8) is 
used, a transmitted letter ^4^ will be produced without error if and only 
if the received symbol is of the form SjAi . That is, there will be no 
error only if the noise in the channel during the transmission of Ai is 
represented by one of the coset leaders. (This applies for i = 1, 2, ■ • • , 
n = 2''). The probability of this event is Qi (Proposition 1, Section 1.6). 
The convention (5) makes Qi as large as is possible for the given alpha- 
bet. 

Let X refer to transmitted letters and let Y refer to letters produced 
by the detector. We use a vertical bar to denote conditions when writing 
probabilities. The quantity to the right of the bar is the condition. We 
suppose the letters of the alphabet to be chosen independently with 
equal probability 2~'. 

The equivocation h{X \ Y) obtained when using an (n, A;)-alphabet 
with the detector (8) can most easily be computed from the formula 

h(X I Y) = h{X) - HY) + h(Y I X) (10) 

The entropy of the source is /i(X) = A;/" bits per symbol. The probability 
that the detector produce Aj when .4,- was sent is the probability that 
the noise be represented by AiAjSt , ^ = 1, 2, ■ ■ ■ , y. In symbols, 
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Pr{Y ^ A J I X ^ .4,) - Z P>iN -^ AiAjS:) - QiA^A,) 

where Q(Ai) is the sum of the probabilities of the elements that are hi 
the same column as Ai in the standard array. Therefore 

Pr{Y -^ Aj) ^'EPriY^A.lX^ Ai)Pr{X -^ A,) = ^ E QiAAj) 

= i sinc'e E QUiA,) = E QUd = 1. 

This last follows from the group property of the alphabet. Therefore 

k{Y) = -- E MY ^ Aj) log Pr{Y -> Aj) = - bits/symbol. 
n n 

Tt follows then from (10) that 

h{X\ Y) = hiY\X) 

The computation of h(Y | X) follows readily from its definition 

HY I X) = E P'iX -^ Adh{Y I X -> A,) 

= -E ^KX -> ^.)FKI^ ^Aj\X^ A,) 

log Pr(Y -^ Aj \ X -^ Ai) 
= -i E E Pr(N -^ASiAj) log E ^KiV ^ A^'„^,) 

= —ZQUiAjUogQiA^A,) 
£• a 

= - 'LQiAi) log Q{Ad 

i 

Each letter is n binary places. Proposition 2, then follows. 

2.3 DISTANCE AND THE PROOF OF THEOREM 1 

Let A and B be two elements of B„ . We define the distance, d{A, B), 
between A and B to be the weight of their product, 

d{A,B) = w(AB) (11) 

The distance between A and B is the number of places in which A and 
B differ and is just the "Hamming distance." ' In terms of the ;i-cube, 
d{A, B) is the minimum number of edges that must be traversed to go 
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from vertex A to vertex B. The distance so defined is a monotone func- 
tion of the Euclidean distance between vertices. 

It fttllows from (1 1 ) that if (' is any element of /i„ then 

d{A,B) =diAC,BC) (12) 

This fact shows the detection scheme (8) to be a maximum hkelihood 
detector. By definition of a standard array, one has 

d(Si , /) ^ d(SiAj , I) for all / and j 

The coset leaders were chosen to make this true. From (12), 

d(Si , I) = diSiA^Si , I A„,Si) - d(SiA„, , A^) 

d(S,Aj , I) = diSAjSiA^ , I SiA^) = d{A^A,., , S^AJ 

= diSiA^,At) 

where At = AjA^. Substituting these expressions in the inequality 
above yields 

d{SiA„ , A^) ^ d(SiA^', At) for all i, m, I 

This equation says that an arbitrary element in the array (4) is at least 
as close to the element at the top of its column as it is to any other letter 
of the alphabet. This is the maximum likelihood property. 

2.4 PROOF OF THEOREM 2 

Again consider an (/i, A-O-alphabet as a set of vertices of the unit n-cube. 
Consider also n mutually perpendicular hyperplanes through the cen- 
troid of the cube parallel to the coordinate planes. We call these planes 
"symnictry planes of the cube" and suppose the planes numbered in 
accordance with the corresponding parallel coordinate planes. 

The reflection of the vertex with coordinates (ai , ao , ■ ■ ■ , a,- , ■ ■ ■ , a^ 
in symmetry plane i yields the vertex of the cube whose coordinates 
are (fli , 02 , ■ • ■ , «i -j- 1, • • • , On). 3\Iore generally, reflecting a given 
vertex successively in symmetry planes «, j, A;, ■ ■ ■ yields a new vertex 
whose coordinates differ from the original vertex precisely in places 
i, j, k ' ' ' • Successive reflections in hyperplanes constitute a transfor- 
mation that leaves distances between points unaltered and is therefore 
a "rotation." The rotation obtained by reflecting successively in sym- 
metry planes i,j, k, etc. can be represented by an n-place symbol having 
a one in places ?', j, k, etc. and a zero elsewhere. 

We now regard a given (n, fc)-alphabet as generated by operating on 
the vertex (0, 0, ■ ■ ■ , 0) of the cube with a certain collection of 2 ro- 
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tation operators. The symbols for these operators are identical with the 
sequences of zeros and ones that form the coordinates of the 2 points. 
It is readily seen that these rotation operators form a group which is 
transitive on the letters of the alphabet and which leave the unit cube 
invariant. Theorem 2 then follows. 

Theorem 2 also follows readily from consideration of the array (4). 
For example, the maximum likelihood region associated with / is the 
set of points f,S2,Si,--',Sy. The maximum likelihood region asso- 
ciated with A i is the set of points At , AiSo , A iS^ , ■ ■ • , A iS, . The 
rotation (successive reflections in symmetry planes of the cube) whose 
symbol is the same as the coordinate sequence of Ai sends the maximum 
likelihood region of I into the maximum likelihood region oi Ai, i = 
1,2, ■•- ,n. 

2.5 PROOF OF THEOREM 3 

That every systematic alphabet is a group alphabet follows trivially 
from the fact that the sum mod 2 of two letters satisfying parity checks 
is again a letter satisfying the parity checks. The totality of letters satis- 
fying given parity checks thus constitutes a finite group. 

To prove that every group alphabet is a systematic code, consider 
the letters of a given (n, A;)-alphabet listed in a column. One obtains in 
this way a matrix with 2''' rows and n columns whose entries are zeros 
and ones. Because the rows are distinct and form a group isomorphic to 
Ck , there are k linearly independent rows (mod 2) and no set of more 
than k independent rows. The rank of the matrix is therefore k. The 
matrix therefore possesses k linearly independent (mod 2) columns and 
the remaining n — k columns are linear combinations of these k. Main- 
taining only these k linearly independent columns, we obtain a matrix of 
k columns and 2* rows with rank k. This matrix must, therefore, have k 
linearly independent rows. The rows, however, form a group under mod 
2 addition and hence, since k are linearly independent, all 2* rows must 
be distinct. The matrix contains only zeros and ones as entries; it has 2 
distinct rows of k entries each. The matrix must be a listing of the num- 
bers from to 2*" — 1 in binary notation. The other n — k columns of 
the original matrix considered are linear combinations of the columns of 
this matrix. This completes the proof of Theorem 3 and Proposition 4. 

2.6 PROOF OF THEOREM 4 

To prove Theorem 4 we first note that the parity check sequence of 
the product of two elements of 5„ is the mod 2 sum of their separate 
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parity check sequences. It follows then that all elements in a given coset 
have the same parity check sequence. For, let the coset be St , SiA2 , 
SiAz , ' ■ ■ SiAy, . Since the elements 7, ^42 , ^43 , ■ • • , A^ all have parity 
check sequence 00 ■ ■ ■ 0, all elements of the coset have parity check 
fl(5.). 

In the array (4) there are 2" *■ cosets. We observe that there are 2" * 
elements of B„ that have zeros in their first k places. These elements 
have parity check symbols identical with the last n — k places of their 
symbols. These elements therefore give rise to 2""*" different parity check 
symbols. The elements must be distributed one per coset. This proves 
Theorem 4. 

2.7 PROOF OF PROPOSITION 5 

If 

.S2.-'-("-*)-f-^)-l 

we can explicity exliibit group alphabets having the property mentioned 
in the paragraph preceding Proposition 5. The notation of the demon- 
stration is cumbersome, but the idea is relatively simple. 

We shall use the notation of paragraph 2.1 for elements of J5„ , i.e., 
an element of B^ will be given by a list of integers that specify what 
places of the sequence for the element contain ones. It will be convenient 
furthermore to designate the first k places of a sequence by the integers 
1, 2, 3, • • • ,k and the remaining n — k places by the "integers" 1', 2', 
3', ■ • ■ , (' , where C = ii — k. For example, if ti = 8, fc = 5, we have 

10111010^ 13452' 
10000100*^ 11' 
00000101 ^ 1'3' 

Consider the group generated by the elements 1', 2', 3', ■ ■ ■ , (-', i.e. 
the 2' elements /, 1', 2', • - ■ , (', 1'2', 1'3', • • ■ , r2'3' ■■•('. Suppose 
these elements listed according to decreasing weight (say in decreasing 
order when regarded as numbers in the decimal system) and numbered 
consecutively. Let Bi be the iih element in the list. Example: if / = 3, 
B, = 1'2'3', Bi = 2'3', B3 = 1'3', B^ - 1'2', B5 = 3', B^ = 2', B, = 1'. 

Consider now the (n, A^-alphabet Avhose generators are 

151,2^2,3^3, ■■■ ,kBt 
We assert that if 
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this alphabet is as good as any other alphabet of 2*^ letters and n places. 

In the first place, we observe that every letter of this (n, ^)-alphabet 
(except /) has unprimed numbers in its symbols. It follows that each of 
the 2' letters /, 1', 2', • • • , f", 1'2', • • ■ , 1'2' - ■ ■ T occurs in a different 
coset of the given {n, fc)-alphabet. For, if two of these letters appeared 
in the same coset, their product (which contains only primed numbers) 
would have to be a letter of the (n, k) alphabet. This is impossible since 
every letter of the (n, k) alphabet has unprimed numbers in its symbol. 
Since there are precisely 2* cosets we can designate a coset by the single 
element of the list Bi, Bi , ■ ■ ■ , B^e = I which appears in the coset. 

We next observe that the condition 

guarantees that B^+i is of weight 3 or less. For, the given condition is 
equivalent to 

'-^'-(o)-(i 

We treat several cases depending on the weight of B^+i . 

If Bh+i is of weight 3, we note that for i = 1,2, ■ ■ ■ , k, the coset con- 
taining Bi also contains an element of weight one, namely the element 
i obtained as the product of Bi with the letter iBi of the given (n, k)- 
alphabet. Of the remaining (2' — k) B's, one Is of weight zero, t are of 

weight one, ( ) are of weight 2 and the remaining are of weight 3. We 

have, then a^ ^ 1, on = f -\- k = n. Now every B of weight 4 occurs in 
the list of generators \Bi , 2B-i , ■ ■ ■ , kBk . It follows that on multi- 
plying this list of generators by any B of weight 3, at least one element 
of weight two will result. (E.g., (l'2'3')(il'2'3'-I') = ^4') Thus every 
coset with a ^ of weight 2 or 3 contains an element of weight 2 and 

o;^ = 2 — ao — oil , 

The argument in case Bk+i is of weight two or one is similar. 

2.8 MODULAR REPRESENTATIONS OF Cn 

In order to explain one of the methods used to obtain the best (n, k)- 
alphabets listed in Tables II and III, it is necessary to digress here to 
present additional theory. 
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It has been remarked that every (n, /i;)-alphabet is isomorphic with 
Ck . Let us suppose the elements of Ck hsted in a column starting with / 
and proceeding in order 7, 1, 2, 3, ■ ■ • , /.:, 12, 13, ■ ■ ■ , (A; - 1)/.:, 123, 

, 123 ■ ■ ■ k. The elements of a given (n, A:)-alphabet can be 

paired off with these abstract elements so as to preserve group multipli- 
cation. This can be done in many different ways. The result is a matrix 
with elements zero and one with n columns and 2*" rows, these latter 
being labelled by the symbols 7, 1,2, ■ ■ ■ etc. What can be said about 
the columns of this matrix? How many different columns are possible 
when all (n, /i)-alphabets and all methods of establishing isomorphism 
with Ck are considered? 

In a given column, once the entries in rows 1,2, ■ ■ ■ , k are known, the 
entire column is determined by the group property. There are therefore 
only 2 possible different columns for such a matrix. A table showing 
these 2 possible columns of zeros and ones will be called a modular re-pre- 
sentation table for Ck . An example of such a table is shown for A; — 4 in 
Table VI. 

It is clear that the columns of a modular representation table can also 
be labelled by the elements of Ck , and that group multiplication of these 
column labels is isomorphic with mod 2 addition of the columns. The 
table is a symmetric matrix. The element with row label A and column 
label B is one if the symbols A and B have an odd number of diffei'eut 
numerals in common and is zero otherwise. 

Every (n, /^-alphabet can be made from a modular representation 
table by choosing n columns of the table (with possible repetitions) at 
least k of which form an independent set. 



Table VI - — Modular Repbesentation Table for Group C4 

I 12 3 4 !2 13 14 23 24 34 123 124 134 23+ 1234 



I 


















































1 





1 











1 




1 











1 




1 





1 


2 








1 








1 







1 


1 





1 







1 


1 


3 











1 













1 





1 


1 




1 


1 


1 


4 














1 







1 







1 







1 


1 


1 


12 





1 


1 













1 


1 












1 


1 





13 





1 





1 





1 




1 


1 





1 










1 





14 





1 








] 


1 












1 


1 







1 





23 








1 


1 





1 












1 







1 








24 








1 





1 


1 




1 


1 




1 


1 




1 








34 











1 


1 







1 


1 







1 










u 


123 





1 


1 


1 










1 







1 


1 










1 


124 





1 


1 





1 










1 





1 













1 


134 





I 





1 


1 


1 







1 


1 










1 





1 


234 








1 


1 


1 


1 




1 



















1 


1 


1234 





1 


1 


1 


1 




















1 




1 


1 






228 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1956 

We henceforth exclude consideration of the column / of a modular 
representation table. Its inclusion in an {n, /{;)-alphabet is clearly a waste 
of 1 binary digit. 

It is easy to show that every column of a modular representation table 
for Ck contains exactly 2*""^ ones. Since an {n, A;)-alphabet is made from 
n such columns the alphabet contains a total of n2''''^ ones and we have 

Proposition 6. The weights of an {n, fc)-alphabet form a partition of 
n2''~^ into 2*^ — 1 non-zero parts, each part being an integer from the set 
1, 2, ■ • ■ , n. 
The identity element always has weight zero, of course. 

It is readily established that the product of two elements of even 
weight is again an element of even weight as is the product of two ele- 
ments of odd weight. The product of an element of even weight with an 
element of odd weight yields an element of odd weight. 

The elements of even weight of an {n, fc)-alphabet form a subgroup 
and the preceding argument shows that this subgroup must be of order 
2*" or 2''""'. If the group of even elements is of order 2''~\ then the collec- 
tion of even elements is a possible (ti, k — l)-alphabet. This (n, k — 1) 
alphabet may, however, contain the column / of the modular represen- 
tation table of Cfc-i . We therefore have 

Pro-position 7. The partition of Proposition 6 must be either into 
2*^—1 even parts or else into 2''"^ odd parts and 2''"^ — 1 even parts. 
In the latter case, the oven parts form a partition of o;2' " where a is 
some integer of the set /c — 1, fc, - ■ ■ ,n and each of the parts is an in- 
teger from the set 1, 2, • • ■ ,n. 

2.9 THE CHARACTERS OF Ck 

Let us replace the elements of B„ (each of which is a sequence of zeros 
and ones) by sequences of -f-l's and — I's by means of the following 
substitution 

\Z\ . (-) 

The multiphcative properties of elements of B„ can be preserved in this 
new notation if we define the product of two -1-1, —1 symbols to be the 
symbol whose ith component is the ordinary product of the t'th compo- 
nents of the two factors. For example, 1011 and 0110 become respectively 
-11 -1 -1 and 1 -1 -U. We have 

(-11 -1 -1)(1 -1 -11) = (-1 -11 -1) 
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-1 



A CLASS OF BINARY SIGNALING ALPHABETS 229 

corresponding to the fact that 

(1011) (0110) = (1101) 

If the +1,-1 symbols are regarded as shorthand for diagonal matrices, 
so that for example 



-11 -1 -1^ 



then group multiplication corresponds to matrix multiphcation. 

(While much of what follows here can be established in an elementary 
way for the simple group at hand, it is convenient to fall back upon the 
established general theory of group representations^ for several proposi- 
tions. 

The substitution (13) converts a modular representation table (col- 
umn / included) into a square array of +i's and — I's. Each column (or 
row) of this array is clearly an irreducible representation of Ch ■ Since Ck 
is Abcliau it has precisely 2 irreducible representations each of degree 
one. These are furnished by the converted modular table. This table also 
furnishes then the characters of the irreducible representations of Ck 
and we refer to it henceforth as a character (able. 

Let x'*(^l) be the entry of the character table in the row labelled A and 
column labelled a. The orthogonality relationship for characters gives 

E x''(A)x'(A) = 2%, 
E x"(A)x"(-B) = 2%n 

where 8 is the usual Kronecker symbol. In particular 

E x'iA)x\A) = E AA) =0, ^^I 

A<ZCk ACCk 

Since each x (A) is + 1 or — 1, these must occur in equal numbers in any 
column /3 5^ /. This implies that each column except 7 of the modular 
representation table contains 2 ~^ ones, a fact used earlier. 

Evoiy matrix representation of Ck can be reduced to its irreducible 
components. If the trace of the matrix representing the element A in an 
arlsitrary matrix representation of Ck is xiA), then this representation 
contains the irreducible representation having label in the character 
table dp times where 
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<^^ = h Z x(A)x'(A) (14) 

Every {n, fc)-alphabet furnishes us with a matrix representation of C*- 
by means of (13) and the procedure outlined below (13). The trace x(^) 
of the matrix representing the element A of Ck is related to the weight 

of the letter by ■ 

x{A) =n^ 2w{A) (15) 

Equations (14) and (15) permit us to compute from the weights of an 
{n, /i)-alphabet what irreducible representations are present in the alpha- 
bet and how many times each is contained. It is assumed here that the 
given alphabet has been made isomorphic to Ch and that the weights are 
labelled by elements of Ck . 

Consider the converse problem. Given a set of numbers Wi .w^ , • ■ ■ , 
wik that satisfy Propositions 6 and 7. From these we can compute 
quantities Xi ^ 'n — 2Wi as in (15). It is clear that the given w's will 
constitute the weights of an (n, fc)-alphabet if and only if the 2*" Xi can 
be labelled with elements of Ck so that the 2^ sums (14) (/3 ranges over 
all elements of Ck) are non-negative integers. The integers d^ tell what 
representations to choose to construct an (n, A:) -alphabet with the given 
weights Wi . ' 

2.10 CONSTRUCTION OF BEST ALPHABETS 

A great many different techniques were used to construct the group 
alphabets listed in Tables II and III and to show that for each n and k 
there are no group alphabets with smaller probability of error. Space 
prohibits the exhibition of proofs for all the alphabets listed. We content 
ourselves here with a sample argument and treat the case «■ — 10, fc = 
4 in detail. 

According to (2) there are iV(10, 4) - 53,743,987 different (10, 4)- 
alphabets. We now show that none is better than the one given in Table 
III. The letters of this alphabet and weights of the letters are 

I 

1 6 7 8 10 5 

2 6 7 9 10 5 

3 5 68910 6 

4 578910 6 
1289 4 
13579 5 
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14569 

2 3 5 7 8 
2 4 5 8 
3467 

1 23579 
12457 10 
13 48 10 

2 3 4 9 10 
12346789 



The notation is that of Section 2.1. By actually forming the standard 
array of this alphabet, it is verified that 

aa — 1, ai = 10, 0:2 ^ 39, 03 ^ 14. 



Table II shows 



C") = 



= 45, whereas 02 = 39, so the given alphabet 



does not correct all possible double errors. In the standard array for the 
alphabet, 39 coset leaders are of weight 2. Of these 39 cosets, 33 have 
only one element of weight 2; the remaining 6 cosets each contain two 
elements of weight 2. This is due to the two elements of weight 4 in the 
gi\'en group, namely 1289 and 3467. A portion of the standard array 
that demonstrates these points is 



1289 



3467 



■ 
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89 
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29 




19 


28 




34 




67 


36 
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47 


37 
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46 



In order to have a smaller probability of error than the exhibited 
alphabet, it is necessary that a (10, 4)-alphabet have an a2 > 39. We 
proceed to show that this is impossible by consideration of the weights 
of the letters of possible (10, 4)-alphabets. 

We first show that every (10, 4)-alphabet must have at least one ele- 
ment (other than the identity, /) of weight less than o. By Propositions 
() and 7, Section2.8, the weights must form a paiiition of 10-8 = 80 into 
15 positive parts. If the weights are all even, at least two must be less 
than 6 since 14 -6 = 84 > 80. If eight of the weights are odd, we see from 
8-5 + 7-6 = 82 > 80 that at least one weight must be less than 5. 
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All alphabet with one or more elements of weight 1 must have an 
cii ^ 36, for there are nine elements of weight 2 which cannot possilily 
be coset leaders. To see this, suppose (without loss of gencrahty) that 
the alphabet contains the letter 1. The elements 12, 13, 14, ■ ■ ■ 1 10 can- 
not possibly be coset leaders since the product of any one of them with 
the letter 1 yields an element of weight 1. 

An alphabet with one or more elements of weight 2 must have an 
ai g 37. Suppose for example, the alphabet contained the letter 12. 
Then 13 and 23 must be in the same coset, 14 and 24 must be in the 
same coset, ■ ■ , 1 10 and 2 10 must be in the same coset. There are at 
least eight elements of weight two which are not coset leaders. 

Each element of weight 3 in the alphabet prevents three elements of 
weight 2 from being coset leaders. For example, if the alphabet contains 
123, then 12, 13, and 23 cannot be eoset leaders. We say that the three 
elements of weight 2 are "blocked" by the letter of weight 3. Suppose an 
alphabet contains at least three letters of weight three. There are several 
cases: (A) if three letters have no numerals in common, e.g., 123, 456, 
789, then nine distinct elements of weight 2 are blocked and m ^ 30; 
(B) if no two of the letters have more than a single numeral in common, 
e.g., 123, 345, 789, then again nine elements of weight 2 are blocked and 
at ^ 36 ; and (C) if two of the letters of weight 3 have two numerals in 
common, e.g., 123, 234, then their product is a letter of weight 2 and by 
the preceding paragraph ai % 37. If an alphabet contains e.xactly two 
elements of weight 3 and no elements of weight 2, the elements of weight 

3 block six elements of weight 2 and ai ^ 39. 

The preceding argument shows that to be better than the exhibited 
alphabet a (10, 4)-alphabet with letters of weight 3 must have just one 
such letter. A similar argument (omitted here) shows that to be better 
than the exhibited alphabet, a (10, 4)-alphabet cannot contain more 
than one element of weight 4. Furthermore, it is easily seen that an 
alphabet contahiing one element of weight 3 and one element of weight 

4 must have an 0:2 ^ 39. ■ 

The only new contenders for best (10, 4)-alphabet are, therefore, 
alphabets with a single letter other than I of weight less than 5, and this 
letter must have weight 3 or 4. Application of Propositions and 7 show 
that the only possible weights for alphabets of this sort are: 35 6 and 
5^46" where 5^ means seven letters of weight 5, etc. We next show that 
there do not exist (10, 4)-alphabets having these weights. 

Consider first the suggested alphabet with weights 35 6 . As explained 
in Section 2.9, from such an alphabet we can construct a matrix repre- 
sentation of Ci having the character x(^) =^ 10, one matrix of trace 4, 
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seven of trace and seven of trace —2. The latter seven matrices cor- 
respond to elements of even weight and together with / must represent 
a subgroup of order 8. We associate them with the subgroup generated 
by the elements 2, 3, and 4. We have therefore 

X(/) = 10, x(2) = x(3) = x(4) = x(23) 

= x(24) = x(34) = x(234) = -2. 

Examination of the symmetries involved shows that it doesn't matter 
how the remaining xi are associated with the remaining group elements. 
We take, for example 

x(l) = 4, x(12) = xim =- x(14) = x{123) 

= x(124) = x(134) = x(1234) = 0. 

Now form the sum shown in equation (14) with /5 ^ 1234 (i.e., with the 
character x'"^* obtahied from column 1234 of the Table VI by means 
of substitution (13). There results dm* = \i which is impossible. There- 
fore there does not exist a (10, 4)-alphabet with w^eights 35 6 . 

The weights 5^40^ correspond to a representation of C4 with character 
x(/) = 10, 0^ 2, ( — 2)\ We take the subgroup of elements of even weight 
to be generated by 2, 3, and 4. Except for the identity, it is clearly im- 
material to which of these elements we assign the character 2. We make 
the following assignment: x(-f) = 10, x(2) = 2, x(3) = x(4) - x(23) = 
X(24) - x(34) - x(234) ^ -2, x(l) = x(12) = x(13) - x(14) = 
x(123J - x(124) = x(134) = x(1234) = 0. The use of equation (14) 
shows that d-i = Yz which is impossible. 

It follows that of the 53,743,987 (10, 4)-alphabets, none is better than 
the one listed on Table III. 

Not all the entries of Table III were established in the manner just 
demonstrated for the (10, 4)-a]phabet. In many cases the search for a 
best alphabet was narrowed down to a few alphabets by simple argu- 
ments. The standard arrays for the alphabets w^ere constructed and the 
best alphabet chosen. For large n the labor in making such a table can 
be considerable and the operations involved are highly liable to error 
when performed by hand. 

I am deeply indebted to V. M. Wolontis who programmed the IBM 
CPC computer to determine the a's of a given alphabet and who pa- 
tiently ran off many such alphabets in course of the construction of 
Tables II and III. I am also indebted to Mrs. D. R. Fursdon who eval- 
uated many of the smaller alphabets by hand. 
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